Evaluating Parsing Schemes with Entropy Indicators
نویسندگان
چکیده
This paper introduces an objective metric for evaluating a parsing scheme It is based on Shannon s original work with letter sequences which can be extended to part of speech tag sequences It is shown that this regular language is an inadequate model for natural language but a representation is used that models language slightly higher in the Chomsky hierarchy We show how the entropy of parsed and unparsed sentences can be measured If the entropy of the parsed sentence is lower this indicates that some of the structure of the language has been captured We apply this entropy indicator to support one particular parsing scheme that e ects a top down segmentation This approach could be used to decompose the parsing task into computationally more tractable subtasks It also lends itself to the extraction of predicate argument structure
منابع مشابه
Towards Entropy-Based Requirements Elicitation
This paper presents our views towards constructing the universal parsing technique for the software requirements texts and the requirements elicitation technique based on the output of this parsing process. With the proposed parsing technique, it should be possible to achieve language-independent processing of the requirements texts. Source sentences are treated as systems with words as element...
متن کاملEntropy of Hybrid Censoring Schemes
A hybrid censoring scheme is a mixture of type I and type II censoring schemes. When $n$ items are placed on a life test, the experiment terminates under type I or type II hybrid censoring scheme if either a pre-fixed censoring time T or the rth (1<=r<=n is fixed) failure is first or later observed, respectively. In this paper, we investigate the decomposition of entropy in both hybrid cen...
متن کاملMeasures and Models
I present an entropy measure for evaluating parser performance. The measure is fine-grained, and permits us to evaluate performance at the level of individual phrases. The parsing problem is characterized as statistically approximating the Penn Treebank annotations. I consider a series of models to “calibrate” the measure by determining what scores can be achieved using the most obvious kinds o...
متن کاملMeasures And Models For Phrase Recognition
I present an entropy measure for evaluating parser performance. The measure is fine-grained, and permits us to evaluate performance at the level of individual phrases. The parsing problem is characterized as statistically approximating the Penn Treebank annotations. I consider a series of models to “calibrate” the measure by determining what scores can be achieved using the most obvious kinds o...
متن کاملLookahead Search for Lossy Context-Based Adaptive Entropy Coding
In this paper we motivate the need for lookahead search in a context based entropy coder. An efficient algorithm based on modeling of the context coder as a finite state machine is presented. A key contribution of this paper is the use of the per survivor processing principle (PSP) to enable a lookahead search in scenarios where adaptive entropy coding is used. Our results show that lookahead s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره cmp-lg/9709009 شماره
صفحات -
تاریخ انتشار 1997